Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A robust algorithm for separation of Chinese characters from line drawings

Identifieur interne : 002710 ( Main/Exploration ); précédent : 002709; suivant : 002711

A robust algorithm for separation of Chinese characters from line drawings

Auteurs : Liang-Hua Chen [République populaire de Chine] ; Jiing-Yuh Wang [République populaire de Chine] ; Hong-Yuan Liao [République populaire de Chine] ; Kuo-Chin Fan [République populaire de Chine]

Source :

RBID : ISTEX:B17A96634211979F6A91F70091C630F9EB4280AE

Abstract

Separating characters from graphics is an important step towards automatic document understanding. In this paper, we propose a robust algorithm to separate Chinese characters from graphics. Our approach is based on clustering the feature points in an image. Two remedy procedures are also proposed to solve the problems caused by the thinning process. This will obtain a better localization of feature points and improve the performance of the separation process. Using our algorithm, all Chinese characters can be separated from graphics without regard to the font style or orientation of the character. Furthermore, our algorithm can also handle the serious case where characters touch/cross lines. The proposed algorithm has been successfully tested on several kinds of line drawings, such as land register maps and form documents.

Url:
DOI: 10.1016/0262-8856(96)01081-5


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>A robust algorithm for separation of Chinese characters from line drawings</title>
<author>
<name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
</author>
<author>
<name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
</author>
<author>
<name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:B17A96634211979F6A91F70091C630F9EB4280AE</idno>
<date when="1996" year="1996">1996</date>
<idno type="doi">10.1016/0262-8856(96)01081-5</idno>
<idno type="url">https://api.istex.fr/document/B17A96634211979F6A91F70091C630F9EB4280AE/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000915</idno>
<idno type="wicri:Area/Istex/Curation">000905</idno>
<idno type="wicri:Area/Istex/Checkpoint">001B21</idno>
<idno type="wicri:doubleKey">0262-8856:1996:Chen L:a:robust:algorithm</idno>
<idno type="wicri:Area/Main/Merge">002854</idno>
<idno type="wicri:Area/Main/Curation">002710</idno>
<idno type="wicri:Area/Main/Exploration">002710</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">A robust algorithm for separation of Chinese characters from line drawings</title>
<author>
<name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Computer Science and Information Engineering, Fu Jen University, HsinChuang, Taipei, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information and Electrical Engineering, National Central University, Chung-Li, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information Science, Academia Sinica, Taipei, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information and Electrical Engineering, National Central University, Chung-Li, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Image and Vision Computing</title>
<title level="j" type="abbrev">IMAVIS</title>
<idno type="ISSN">0262-8856</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1995">1995</date>
<biblScope unit="volume">14</biblScope>
<biblScope unit="issue">10</biblScope>
<biblScope unit="page" from="753">753</biblScope>
<biblScope unit="page" to="761">761</biblScope>
</imprint>
<idno type="ISSN">0262-8856</idno>
</series>
<idno type="istex">B17A96634211979F6A91F70091C630F9EB4280AE</idno>
<idno type="DOI">10.1016/0262-8856(96)01081-5</idno>
<idno type="PII">0262-8856(96)01081-5</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0262-8856</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Separating characters from graphics is an important step towards automatic document understanding. In this paper, we propose a robust algorithm to separate Chinese characters from graphics. Our approach is based on clustering the feature points in an image. Two remedy procedures are also proposed to solve the problems caused by the thinning process. This will obtain a better localization of feature points and improve the performance of the separation process. Using our algorithm, all Chinese characters can be separated from graphics without regard to the font style or orientation of the character. Furthermore, our algorithm can also handle the serious case where characters touch/cross lines. The proposed algorithm has been successfully tested on several kinds of line drawings, such as land register maps and form documents.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>République populaire de Chine</li>
</country>
</list>
<tree>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
</noRegion>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
<name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002710 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002710 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:B17A96634211979F6A91F70091C630F9EB4280AE
   |texte=   A robust algorithm for separation of Chinese characters from line drawings
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024